Corpus: bul_newscrawl_2015_30K

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 8878 п-
2 4961 с-
3 3747 н-
4 3486 о-
5 3141 к-
Top Character Bigrams
word rank frequency n-gram
1 3716 пр-
2 3197 по-
3 2130 на-
4 1869 из-
5 1869 за-
Top Character Trigrams
word rank frequency n-gram
1 1447 пре-
2 1142 про-
3 1047 раз-
4 828 при-
5 623 под-
Top Character 4-Grams
word rank frequency n-gram
1 509 пред-
2 456 най--
3 189 разп-
4 163 пост-
5 133 прес-
Top Character 5-Grams
word rank frequency n-gram
1 107 предп-
2 99 предс-
3 82 благо-
4 79 прост-
5 72 произ-
575 msec needed at 2018-02-04 18:46